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GENERAL INFORMATION : 
(i) APPLICANT: 



Bahi]a Jallal 
Gregory D . Plowman 



(ii) TITLE OF INVENTION: 
(lii) NUMBER OF SEQUENCES: 



DIAGNOSIS AND TREATMENT OF 
PTP04 RELATED DISORDERS 

18 



(iv) CORRESPONDENCE ADDRESS : 



(A) ADDRESSEE: Lyon & Lyon 

(B) STREET: 633 West Fifth Street 

Suite 4700 

(C) CITY: Los Angeles 

(D) STATE: California 

(E) COUNTRY: U.S.A. 

(F) ZIP: 90071-2066 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE : 

(B) COMPUTER: 

(C) OPERATING SYSTEM: 

(D) SOFTWARE: 



3.5" Diskette, 1.44 Mb 
storage 

IBM Compatible 
IBM P.C. DOS 5.0 
FastSEQ for Windows 2.0 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: To Be Assigned 

(B) FILING DATE: Herewith 

(C) CLASSIFICATION: 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 60/047..222 

(B) FILING DATE: May 20, 1997 



(viii) ATTORNEY /A.GENT INFORMATION: 
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(A) NAME: 

(B) REGISTRATION NUMBER: 

(C) REFERENCE /DOCKET NUMBER : 



Warburg, Richard J. 

32 , 327 

234/253 



!ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (213) 489-1600 

(B) TELEFAX: (213) 955-0440 

(C) TELEX: 67-3510 



(2) INFORMATION FOR SEQ ID NO: 1: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3580 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

CCCGGGTGCC CTCCCTCAAC CTACTTATAG ACTATTTTTC TTGCTCTGCA GCATGGACCA 6Q 

AAGAGAAATT CTGCAGAAGT TCCTGGATGA GGCCCAAAGC AAGAAAATTA CTAAAGAGGA 12 0 

GTTTGCCAAT GAATTT CTGA AGCTGAAAAG GCAATCTACC AAGTACAAGG CAGACAAAAC 18 0 

CTATCCTACA ACTGTGGCTG AGAAGCCCAA GAAT AT CAAG AAAAACAGAT ATAAGGATAT 24 0 

TTTGCCCTAT GATTATAGCC GGGTAGAACT AT CCCTGATA ACCTCTGATG AGGATT C CAG 30 0 

CTA CAT GAAT GCCAACTTCA TTAAGGGAGT TTATGGACCC AAGGCTTATA TTGCCACCCA 360 

GGGTCCTTTA TCTACAACCC TCCTGGACTT CTGGAGGATG ATTTGGGAAT ATAGTGTCCT 42 0 

TAT CATTGTT ATGGCATGCA TGGAGTATGA AATGGGAAAG AAAAAGTGTG AGCGCTACTG 48 0 

GGCTGAGCCA GGAGAGATGC AGCTGGAATT TGGCCCTTTC TCTGTATCCT GTGAAGCTGA 54 0 

AAAAAGGAAA TCTGATTATA TAATCAGGAC TCTAAAAGTT AAGTT CAATA GTGAAACTCG 60 0 

AACTAT CT AC CAGTTTCATT ACAAGAATTG GCCAGACCAT GATGTACCTT CATCTATAGA 660 

CCCTATTCTT GAG CT CAT CT GGGATGTACG TTGTTACCAA GAGGATGACA GTGTTCCCAT 72 0 

ATGCATTCAC TGCAGTGCTG GCTGTGGAAG GACTGGTGTT ATTTGTGCTA T TG ATT AT A C 780 

ATGGATGTTG CT AAAAG AT G GGATAATTCC TGAGAACTTC AGTGTTTTCA GTTTGATCCG 84 0 

GGAAATGCGG ACACAGAGGC CTT CATTAGT T CAAACG CAG GAACAATATG AACTGGTCTA 90 0 

CAATGCTGTA TTAGAACTAT TTAAGAGACA GATGGATGTT AT CAGAGATA AACATT CTGG 96 0 

AA CAG AG AG T CAAGCAAAGC ATTGTATTCC TGAGAAAAAT CACACTCTCC AAGCAGACTC 10 2 0 

TTATTCTCCT AATTTACCAA AAAGTAC CAC AAAAG CAG CA AAAAT G AT G A ACCAACAAAG 10 3 0 

GACAAAAATG GAAATCAAAG AATCTTCTTC CTTTGACTTT AGGACTT CTG AAATAAGTGC 114 0 

AAAAGAAGAG CTAGTTTTGC ACCCTGCTAA AT CAAG CA CT TCTTTTGACT TTCTGGAGCT 12 0 0 

AAATTACAGT TTTGACAAAA AT G CTGA CAC AACCATGAAA TGGCAGACAA AGGCATTTCC 12 6 0 

AATAGTTGGG GAGCCTCTTC AG AAG CAT CA AAGTTTGGAT TTGGGCTCTC TTTTGTTTGA 13 2 0 

GGGATGTTCT AATTCTAAAC CTGTAAATGC AGCAGGAAGA TATTTTAATT CAAAGGTGCC 13 8 0 

AATAACACGG ACCAAATCAA CTCCTTTTGA AT T GAT A CAG CAGAGAGAAA CCAAGGAGGT 144 0 
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GGACAGCAAG GAAAACTTTT CTTATTTGGA ATCTCAACCA CATGATTCTT GTTTTGTAGA 
GATGCAGGCT CAAAAAGTAA TGCATGTTTC TTCAGCAGAA CTGAATTATT CACTGCCATA 
TGACTCTAAA CACCAAATAC GTAATGCCTC TAATGTAAAG CACCATGACT CTAGTGCTCT 
TGGTGTATAT TCTTACATAC CTTTAGTGGA AAATCCTTAT TTTTCATCAT GGCCTCCAAG 
TGGTACCAGT TCTAAGATGT CTCTTGATTT ACCTGAGAAG CAAGATGGAA CTGTTTTTCC 
TTCTTCTCTG TTGCCAACAT CCTCTACATC CCTCTTCTCT TATTACAATT CACATGATTC 
TTTATCACTG AATTCTCCAA CCAATATTTC CTCACTATTG AACCAGGAGT CAGCTGTACT 
AGCAACTGCT CCAAGGATAG ATGATGAAAT CCCCCCTCCA CTTCCTGTAC GGACACCTGA 
ATCATTTATT GTGGTTGAGG AAG CTGGAGA ATTCTCACCA AATGTTCCCA AATCCTTATC 
CTCAGCTGTG AAGGTAAAAA TTGGAACATC ACTGGAATGG GGTGGAACAT CTGAACCAAA 
GAAATTTGAT GACTCTGTGA TACTTAGACC AAG CAAGAGT GTAAAACTCC GAAGTCCTAA 
AT CAGAACTA CAT CAAGAT C GTTCTTCTCC CCCACCTCCT CTCCCAGAAA GAACTCTAGA 
GTCCTTCTTT CTTGCCGATG AAGATTGTAT GCAGGCCCAA T CT ATAG AAA GATATTCTAC 
TAGCTATCCT GACACCATGG AAAATTCAAC ATCTTCAAAA CAGACACTGA AGACTCCTGG 
AAAAAGTTTC ACAAGGAGTA AGAGTTTGAA AATTTTGCGA AACATGAAAA AGAGTAT CTG 
TAATTCTTGC CCACCAAACA AGCCTGCAGA ATCTGTTCAG TCAAATAACT CCAGCTCATT 
TCTGAATTTT GGTTTTGCAA ACCGTTTTTC AAAACCCAAA GGACCAAGGA ATCCACCACC 
AACTTGGAAT ATTTAATAAA ACTCCAGATT TATAATAATA TGGGCTGCAA GTACACCTGC 
AAATAAAACT ACTAGAATAC TGCTAGTTAA AATAAGTGCT CTATATGCAT AAT AT CAAAT 
ATGAAGATAT G CT AATGTGT TAATAGCTTT TAAAAGAAAA GCAAAATGCC AATAAGTGCC 
AGTTTTGCAT TTTCATATCA TTTGCATTGA GTTGAAAACT GCAAATAAAA GTTTGTCACT 
TGAG CTTATG TACAGAATGC TATATGAGAA ACACTTTTAG AATGGATTTA TTTTTCATTT 
TTGCCAGTTA TTTTTATTTT CTTTTACTTT TTTACATAAA CATAAACTTC AAAAGGTTTG 
TAAGATTTGG AT CT CAACT A ATTTCTACAT TGCCAGAATA TACTATAAAA AGTTAAAAAA 
AAACTTACTT TGTGGGTTGC AATACAAACT GCTCTTGACA ATGACTATTC CCTGACAGTT 
ATTTTTGCCT AAATGGAGTA TACCTTGTAA ATCTTCCCAA ATGTTGTGGA AAACTGGAAT 
ATTAAGAAAA TGAGAAATTA TATTTATTAG AATAAAATGT G CAAAT AATG ACAATTATTT 
GAATGTAACA AGGAATT CAA CTGAAAT C CT GATAAGTTTT AACCAAAGTC ATTAAATTAC 
CAATTCTAGA AAAGTAATCA ATGAAATATA ATAGCTATCT TTTGGTAGCA AAAGATATAA 
ATTGTATATG TTTATACAGG ATCTTTCAGA TCATGTGCAA TTTTTATCTA ACCAATCAGA 
AATACTAGTT TAAAATGAAT TTCTATATGA ATATGGAT CT GCCATAAGAA AAT CTAGTT C 
AACTCTAATT TTATGTAGTA AATAAATTGG CAGGTAATTG TTTTTACAAA GAATCCACCT 
GACTTCCCCT AATG CAT TAA AAATATTTTT ATTTAAATAA CTTTATTTAT AACTTTTAGA 
AACATGTAGT ATTGTTTAAA CATCATTTGT TCTTCAGTAT TTTTCATTTG GAAGTCCAAT 
AGGGCAAATT GAATGAAGTA TTATTAT CTG TCTCTTGTAG TACAATGTAT CCAACAGACA 
CT CAA T AAA C TTTTTGGTTG TTAAAAAAAA AAAAAAAAAA 



(2) INFORMATION FOR SEQ ID NO : 2: 
(l) SEQUENCE CHARACTERISTICS: 



1500 
1560 
162Q 
1680 
1740 
1800 
1860 
192 0 
1980 
2040 
2100 
2160 

2 2 2- 
2230 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 

3 0 0 0 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3580 



(A) 
(B) 
(C) 
( D ) 



TYPE : 

STRANDEDNESS : 
TOPOLOGY: 



LENGTH : 



8 0 7 amino acids 
amino acid 
single 
1 inear 



MOLECULE TYPE : 



peptide 
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Met Asp Gin Arg Glu lie Leu Gin Lys Phe Leu Asp Glu Ala Gin Ser 

Lys Lys lie Thr Lys Glu Glu Phe Ala Asn Glu Phe Leu Lys Leu Lys 
20 25 30 



Arg Gin Ser Thr Lys Tyr Lys Ala Asp Lys Thr Tyr Pro Thr Thr Val 
35 40 45 

Ala Glu Lys Pro Lys Asn lie Lys Lys Asn Arg Tyr Lys Asp lie Leu 
50 55 60 

Pro Tyr Asp Tyr Ser Arg Val Glu Leu Ser Leu lie Thr Ser Asp Glu 
65 70 75 80 

Asp Ser Ser Tyr lie Asn Ala Asn Phe He Lys Gly Val Tyr Gly Pro 
85 90 95 

Lys Ala Tyr He Ala Thr Gin Gly Pro Leu Ser Thr Thr Leu Leu Asp 
100 105 HO 

Phe Trp Arg Met He Trp Glu Tyr Ser Val Leu He He Val Met Ala 
115 120 125 

Cys Met Glu Tyr Glu Met Gly Lys Lys Lys Cys Glu Arg Tyr Trp Ala 
130 135 140 

Glu Pro Gly Glu Met Gin Leu Glu Phe Gly Pro Phe Ser Val Ser Cys 
145 150 155 160 

Glu Ala Glu Lys Arg Lys Ser Asp Tyr He He Arg Thr Leu Lys Val 
165 170 175 

Lys Phe Asn Ser Glu Thr Arg Thr He Tyr Gin Phe His Tyr Lys Asn 
180 185 190 

Trp Pro Asp His Asp Val Pro Ser Ser He Asp Pro He Leu Glu Leu 
195 200 205 

He Trp Asp Val Arg Cys Tyr Gin Glu Asp Asp Ser Val Pro He Cys 
210 215 220 

He His Cys Ser Ala Gly Cys Gly Arg Thr Gly Val He Cys Ala He 
225 230 235 240 

Asn T Tr r Thr Trp Met Leu Leu Lys Asp Gly He He Pro Glu Asn Phe 
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245 250 255 

Ser Val Phe Ser Leu lie Arg Glu Met Arg Thr Gin Arg Pro Ser Leu 
260 265 270 

Val Gin Thr Gin Glu Gin Tyr Glu Leu Val Tyr Asn Ala Val Leu Glu 
275 280 285 

Leu Phe Lys Arg Gin Met Asp Val lie Arg Asp Lys His Ser Gly Thr 
290 295 300 



Glu Ser Gin Ala Lys His Cys lie Pro Glu Lys Asn His Thr Leu Gin 
305 310 315 320 

Ala Asp Ser Tyr Ser Pro Asn Leu Pro Lys Ser Thr Thr Lys Ala Ala 
325 330 335 

Lys Met Met Asn Gin Gin Arg Thr Lys Met Glu He Lys Glu Ser Ser 
340 345 350 

Ser Phe Asp Phe Arg Thr Ser Glu He Ser Ala Lys Glu Glu Leu Val 
355 360 365 

Leu His Pro Ala Lys Ser Ser Thr Ser Phe Asp Phe Leu Glu Leu Asn 

370 375 380 

Tyr Ser Phe Asp Lys Asn Ala Asp Thr Thr Met Lys Trp Gin Thr Lys 
385 390 395 400 

Ala Phe Pro He Val Gly Glu Pro Leu Gin Lys His Gin Ser Leu Asp 
405 410 415 

Leu Gly Ser Leu Leu Phe Glu Gly Cys Ser Asn Ser Lys Pro Val Asn 
420 425 430 

Ala Ala Gly Arg Tyr Phe Asn Ser Lys Val Pro He Thr Arg Thr Lys 
435 440 445 

Ser Thr Pro Phe Glu Leu He Gin Gin Arg Glu Thr Lys Glu Val Asp 
450 455 460 

Ser Lys Glu Asn Phe Ser Tyr Leu Glu Ser Gin Pro His Asp Ser Cys 
465 470 475 480 

Phe Val Glu Met Gin Ala Gin Lys Val Met His Val Ser Ser Ala Glu 
485 490 495 

Leu Asn Tyr Ser Leu Pro Tyr Asp Ser Lys His Gin He Arg Asn Ala 
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500 505 510 

Ser Asn Val Lys His His Asp Ser Ser Ala Leu Gly Val Tyr Ser Tyr 
515 520 525 

lie Pro Leu Val Glu Asn Pro Tyr Phe Ser Ser Trp Pro Pro Ser Gly 
530 535 540 

Thr Ser Ser Lys Met Ser Leu Asp Leu Pro Glu Lys Gin Asp Gly Thr 
545 550 555 560 

Val Phe Pro Ser Ser Leu Leu Pro Thr Ser Ser Thr Ser Leu Phe Ser 



Tyr Tyr Asn Ser His Asp Ser Leu Ser Leu Asn Ser Pro Thr Asn lie 
530 585 590 



Ser Ser Leu Leu Asn Gin Glu Ser Ala Val Leu Ala Thr Ala Pro Arg 
595 600 605 

lie Asp Asp Glu lie Pro Pro Pro Leu Pro Val Arg Thr Pro Glu Ser 
610 615 620 

Phe lie Val Val Glu Glu Ala Gly Glu Phe Ser Pro Asn Val Pro Lys 
625 630 635 640 

Ser Leu Ser Ser Ala Val Lys Val Lys lie Gly Thr Ser Leu Glu Trp 
645 650 655 

Gly Gly Thr Ser Glu Pro Lys Lys Phe Asp Asp Ser Val lie Leu Arg 
660 665 670 

Pro Ser Lys Ser Val Lys Leu Arg Ser Pro Lys Ser Glu Leu His Gin 
675 680 685 

Asp Arg Ser Ser Pro Pro Pro Pro Leu Pro Glu Arg Thr Leu Glu Ser 
690 695 700 

Phe Phe Leu Ala Asp Glu Asp Cys Met Gin Ala Gin Ser lie Glu Thr 
705 710 715 720 

Tyr Ser Thr Ser Tyr Pro Asp Thr Met Glu Asn Ser Thr Ser Ser Lys 
725 730 735 

Gin Thr Leu Lys Thr Pro Gly Lys Ser Phe Thr Arg Ser Lys Ser Leu 
740 745 750 
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Lys lie Leu Arg Asn Met Lys Lys Ser lie Cys Asn Ser Cys Pro Pro 

755 760 765 

Asn Lys Pro Ala Glu Ser Val Gin Ser Asn Asn Ser Ser Ser Phe Leu 

770 775 780 

Asn Phe Gly Phe Ala Asn Arg Phe Ser Lys Pro Lys Gly Pro Arg Asn 

78 5 7 90 795 800 



Pro Pro Pro Thr Trp Asn lie 
805 



(2) INFORMATION FOR SEQ ID NO: 3: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
C D ) TOPOLOGY : linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: The letter "Y" stands for C or T. 

The letter "V" stands for A, C or 
G. 

The letter "R" stands for A or G . 
The letter " N" stands for A, C, G 
or T . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GAYTTYTGGV RNATGRTNTG GGA 2 3 



(2) INFORMATION FOR SEQ ID NO : 4: 
U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY : linear 
( ix) FEATURE : 

(D) OTHER INFORMATION: The letter "S" stands for C or G. 

The letter "Y" stands for C or T. 
The letter " N" stands for A, C, G 
or T. 

The letter "W" stands for A or T . 
The letter "R" stands for A or G. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

7GGCCSAYNC CNGCNSWRCA RTG 2 3 



(2) INFORMATION FOR SEQ ID NO : 5: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE : 

(D) OTHER INFORMATION: "Xaa" in positions 4 and 6 stand 

for an unspecified amino acid . 
"Xaa" in position 8 stands for 
either Glu or Asp. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Asp Phe Trp Xaa Met Xaa Trp Xaa 

1 5 



(2) INFORMATION FOR SEQ ID NO: 
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( i ) S E QUE N CE CKARACT ERI STI LS : 

(A) LENGTH : 

(B) TYPE: 

(C) STRANDEDNESS 

(D) TOPOLOGY: 

(11) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(D) OTHER INFORMATION: "Xaa" in positions 3 and 6 stan 

for an unspecified amino acid. 

(Xl ) SEQUENCE DESCRIPTION: SEQ ID NO : 6: 

His Cys Xaa Ala Gly Xaa Gly 
1 5 



7 amino acids 
amino acid 
single 
linear 



(2) INFORMATION FOR SEQ ID NO : 7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7: 
CACCGTTCGA GTATTT CAGA TTGTGAAGAA GTCC 



(2) INFORMATION FOR SEQ ID NO: 8: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
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GGACTTCTTC ACAATCTGAA ATACTCGAAC GGTG 3 4 

(2) INFORMATION FOR SEQ ID NO : 9: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CCGTTATGTG AGGAAGAGCC ACATTACAGG ACC 33 



(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GGTCCTGTAA TGTGGCTCTT CCTCACATAA CGG 



(2) INFORMATION FOR SEQ ID NO : 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GGCATGCATG GAGTATGAAA TGG 23 
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'2} INFORMATION FOR SEQ ID NO: 12: 
(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CGTACATCCC AGATGAGCTC AAGAATAGGG 



2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Ser 
15 10 



(2) INFORMATION FOR SEQ ID NO: 14: 
(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(li) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 14: 

His Cys Ser Ala Gly 
1 5 
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( 2 ^ INFORMATION FOR SEQ ID MO: 15: 
(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 3 1 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Ser Trp Pro Pro Ser Gly Thr Ser Ser Lys Met Ser Leu Asp Asp Leu 
15 10 15 

Pro Glu Lys Gin Asp Gly Thr Val Phe Pro Ser Ser Leu Leu Pro 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 16: 
(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Tyr Ser Leu Pro Tyr Asp Ser Lys His Gin He Arg Asn Ala Ser Asn 
15 10 15 

Val Lys His His Asp Ser Ser Ala Leu Gly Val Tyr Ser Tyr 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 17: 
(l) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 3 0 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDMESS : Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

His Thr Leu Gin Ala Asp Ser Tyr Ser Pro Asn Leu Pro Lys Ser Thr 

15 10 15 



iHr uy t> i. a nid Liya i'itsu 

20 25 



(2) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 802 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 18: 

Met Asp Gin Arg Glu lie Leu Gin Gin Leu Leu Lys Glu Ala Gin Lys 
15 10 15 

Lys Lys Leu Asn Ser Glu Glu Phe Ala Ser Glu Phe Leu Lys Leu Lys 
20 25 30 

Arg Gin Ser Thr Lys Tyr Lys Ala Asp Lys lie Tyr Pro Thr Thr Val 
35 40 45 

Ala Gin Arg Pro Lys Asn lie Lys Lys Asn Arg Tyr Lys Asp lie Leu 
50 55 60 

Pro Tyr Asp His Ser Leu Val Glu Leu Ser Leu Leu Thr Ser Asp Glu 
65 70 75 80 

Asp Ser Ser Tyr lie Asn Ala Ser Phe He Lys Gly Val Tyr Gly Pro 
85 90 95 
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Lys Ala Tyr He Ala Thr Gin Gly Pro Leu Ser Thr Thr Leu Leu Asp 
100 105 110 

Phe Trp Arg Met He Trp Glu Tyr Arg lie Leu Val He Val Met Ala 
115 120 125 

Cys Met Glu Phe Glu Met Gly Lys Lys Lys Cys Glu Arg Tyr Trp Ala 

XJU J. J) -J j--±w 

Glu Pro Gly Glu Thr Gin Leu Gin Phe Gly Pro Phe Ser He Ser Cys 
145 150 155 160 

Glu Ala Glu Lys Lys Lys Ser Asp Tyr Lys He Arg Thr Leu Lys A. 1 a 
165 170 175 

Lys Phe Asn Asn Glu Thr Arg He He Tyr Gin Phe His Tyr Lys Asn 
180 185 190 

Trp Pro Asp His Asp Val Pro Ser Ser He Asp Pro He Leu Gin Leu 
195 200 205 

He Trp Asp Met Arg Cys Tyr Gin Glu Asp Asp Cys Val Pro He Cys 
210 215 220 

He His Cys Ser Ala Gly Cys Gly Arg Thr Gly Val He Cys Ala Val 
225 230 235 240 

Asp Tyr Thr Trp Met Leu Leu Lys Asp Gly He He Pro Lys Asn Phe 
245 250 255 

Ser Val Phe Asn Leu He Gin Glu Met Arg Thr Gin Arg Pro Ser Leu 
260 265 270 

Val Gin Thr Gin Glu Gin Tyr Glu Leu Val Tyr Ser Ala Val Leu Glu 
275 280 285 

Leu Phe Lys Arg His Met Asp Val He Ser Asp Asn His Leu Gly Arg 

290 295 300 



Glu He Gin Ala Gin Cys Ser He Pro Glu Gin Ser Leu Thr Val Glu 
305 310 315 320 

Ala Asp Ser Cys Pro Leu Asp Leu Pro Lys Asn Ala Met Arg Asp Val 
325 330 335 

Lys Thr Thr Asn Gin His Ser Lys Gin Gly Ala Glu Ala Glu Ser Thr 
340 345 350 
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Gly Gly Ser Ser Leu Gly Leu Arg Thr Ser Thr Met Asn Ala Glu Glu 
355 360 365 

Glu Leu Val Leu Kis Ser Ala Lys Ser Ser Pro Ser Phe Asn Cys Leu 
370 375 380 

Glu Leu Asn Cys Gly Cys Asn Asn Lys Ala Val lie Thr Arg Asn Gly 

385 390 395 400 

Gin Ala Arg Ala Ser Pro Val Val Gly Glu Pro Leu Gin Lys Tyr Gin 
405 410 415 

Ser Leu Asp Phe Gly Ser Met Leu Phe Gly Ser Cys Pro Ser Ala Leu 
420 425 430 

Pro lie Asn Thr Ala Asp Arg Tyr His Asn Ser Lys Gly Pro Val Lys 
435 440 445 

Arg Thr Lys Ser Thr Pro Phe Glu Leu lie Gin Gin Arg Lys Thr Asn 
450 455 460 

Asp Leu Ala Val Gly Asp Gly Phe Ser Cys Leu Glu Ser Gin Leu His 
465 470 475 480 

Glu His Tyr Ser Leu Arg Glu Leu Gin Val Gin Arg Val Ala His Val 
485 490 495 

Ser Ser Glu Glu Leu Asn Tyr Ser Leu Pro Gly Ala Cys Asp Ala Ser 
500 505 510 

Cys Val Pro Arg His Ser Pro Gly Ala Leu Arg Val His Leu Tyr Thr 
515 520 525 

Ser Leu Ala Glu Asp Pro Tyr Phe Ser Ser Ser Pro Pro Asn Ser Ala 
530 535 540 

Asp Ser Lys Met Ser Phe Asp Leu Pro Glu Lys Gin Asp Gly Ala Thr 

545 550 555 560 



Ser Pro Gly Ala Leu Leu Pro Ala Ser Ser Thr Thr Ser Phe Phe Tyr 

565 570 575 

Ser Asn Pro His Asp Ser Leu Val Met Asn Thr Leu Thr Ser Phe Ser 
530 585 590 

Pro Pro Leu Asn Gin Glu Thr Ala Val Glu Ala Pro Ser Arg Arg Thr 
595 600 605 
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Asp Asp Glu lie Pro Pro Pro Leu Pro Glu Arg Thr Pro Glu Ser Phe 
610 615 620 

lie Val Val Glu Glu Ala Gly Glu Pro Ser Pro Arg Val Thr Glu Ser 
625 630 635 640 

Leu Pro Leu Val Val Thr Phe Gly Ala Ser Pro Glu Cys Ser Gly Thr 
645 650 655 

Ser Glu Met Lys Ser His Asp Ser Val Gly Phe Thr Pro Ser Lys Asn 
660 665 670 

Val Lys Leu Arg Ser Pro Lys Ser Asp Arg His Gin Asp Gly Ser Pro 
675 680 685 

Pro Pro Pro Leu Pro Glu Arg Thr Leu Glu Ser Phe Phe Leu Ala Asp 
690 695 700 

Glu Asp Cys lie Gin Ala Gin Ala Val Gin Thr Ser Ser Thr Ser Tyr 
705 710 715 720 

Pro Glu Thr Thr Glu Asn Ser Thr Ser Ser Lys Gin Thr Leu Arg Thr 
725 730 735 

Pro Gly Lys Ser Phe Thr Arg Ser Lys Ser Leu Lys lie Phe Arg Asn 
740 745 750 

Met Lys Lys Ser Val Cys Asn Ser Ser Ser Pro Ser Lys Pro Thr Glu 
755 760 765 

Arg Val Gin Pro Lys Asn Ser Ser Ser Phe Leu Asn Phe Gly Phe Gly 
770 775 780 

Asn Arg Phe Ser Lys Pro Lys Gly Pro Arg Asn Pro Pro Ser Ala Trp 
785 790 795 800 

Asn Met 
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